Efficient Monte Carlo Methods for Conditional Logistic Regression
نویسندگان
چکیده
Exact inference for the logistic regression model is based on generating the permutation distribution of the sufficient statistics for the regression parameters of interest conditional on the sufficient statistics for the remaining (nuisance) parameters. Despite the availability of fast numerical algorithms for the exact computations, there are numerous instances where a data set is too large to be analyzed by the exact methods, yet too sparse or unbalanced for the maximum likelihood approach to be reliable. What is needed is a Monte Carlo alternative to the exact conditional approach which can bridge the gap between the exact and asymptotic methods of inference. The problem is technically hard because conventional Monte Carlo methods lead to massive rejection of samples that do not satisfy the linear integer constraints of the conditional distribution. We propose a network sampling approach to the Monte Carlo problem that eliminates rejection entirely. Its advantages over alternative saddlepoint and Markov Chain Monte Carlo approaches are also discussed.
منابع مشابه
Lattice Points, Contingency Tables, and Sampling
Markov chains and sequential importance sampling (SIS) are described as two leading sampling methods for Monte Carlo computations in exact conditional inference on discrete data in contingency tables. Examples are explained from genotype data analysis, graphical models, and logistic regression. A new Markov chain and implementation of SIS are described for logistic regression.
متن کاملApproximate conditional inference in mixed-effects models with binary data
1 Summary Conditional likelihood approach is a sensible choice for a hierarchical logistic regression model or other generalized regression models with binary data. However, its heavy computational burden limits its use, especially for the related mixed effects model. In this paper, we use modified profile likelihood as an accurate approximation to conditional likelihood, and then propose the u...
متن کاملMonte Carlo error in the Bayesian estimation of risk ratios using log-binomial regression models: an efficient MCMC method
In cohort studies binary outcomes are very often analyzed by logistic regression. However, it is well-known that when the goal is to estimate a risk ratio, the logistic regression is inappropriate if the outcome is common. In these cases, a log-binomial regression model is preferable. On the other hand, the estimation of the regression coefficients of the log-binomial model is difficult due to ...
متن کاملComparison of two Markov chain Monte Carlo (MCMC) methods
As the world advances, statisticians/mathematicians are being involved into more and more complex surveys for the development of society and human beings. Consequently, these complex survey data requires complicated and high-dimensional models for final analysis. We need sophisticated and efficient statistical/mathematical tools for estimation and prediction of these models. Frequently, we simu...
متن کاملBuilding Detection Using Aerial Images and Digital Surface Models
In this paper a method for building detection in aerial images based on variational inference of logistic regression is proposed. It consists of three steps. In order to characterize the appearances of buildings in aerial images, an effective bag-of-Words (BoW) method is applied for feature extraction in the first step. In the second step, a classifier of logistic regression is learned using th...
متن کامل